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(1) GENERAL INFORMATION: 

(i) APPLICANT: STUIVER, Maarten Hendrik 

OUSTERS, Jerome Humbertina Henricus Victor 
SE LA -BURL AGE, Marianne Beatrix 
MELCHERS, Leo Sjoerd 

VAN DEVENTER-TROOST, Johanna Pieternella 
LAGE WEG , We S S e 1 
PONSTEIN, Anne Silene 
LAGE WEG, Wessel 
PONSTEIN, Anne Silene 

(ii) TITLE OF INVENTION: ANTIFUNGAL PROTEINS, DNA CODING THEREFOR, 

AND HOSTS INCORPORATING SAME. 

(iii) NUMBER OF SEQUENCES: 75 

(iv) CORRESPONDENCE ADDRESS: (A) ADDRESSEE: LADAS & PARRY 

(B) STREET 26 WEST 61 STREET 

(C) CITY NEW YORK 

(D) NY 

(E) USA 

(F) 10023 - 7604 

(v) COMPUTER READABLE FORM: (A) MEDIUM TYPE: 3.25" Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: WINDOWS 95 

(D) SOFTWARE: WORDPERFECT 8 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09,7258,031 

(B) FILING DATE: 25-FEB-1999 

(C) CLASSIFICATION: 435 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/EP9 7 / 04 92 3 

(B) FILING DATE: 04-SEP-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP97200831.2 

(B) FILING DATE: 19-MAR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP96202466.7 

(B) FILING DATE: 04-SEP-1996 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv . zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
15 10 15 

Ala Thr Leu Gly Glu Val Tyr Tyr Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv . zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr Pro Gly Xaa Ser 
15 10 15 

Ser Phe Pro Thr Val Leu Gin Asn Tyr 
20 25 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AACTTCTCCN AGNGTNGCNC CNGCTTGNAC CCA 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GATCCNTCTT TCCCNATTAC TGGNGAGGTT TA 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv. zebulon 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



GAT CCG TCT TTC CCG ATT ACT GGG GAG GTT TAG ACT CCC GGA AAC TCA 4 8 

Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser 
15 10 15 



TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT 96 

Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn Leu Arg Phe Asn 

20 25 30 

GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT 144 

Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr Ala Glu His Val 

35 40 45 

TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA 192 

Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 

50 55 60 

CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT 24 0 

Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 

65 70 75 80 

ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG 28 8 

Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met Phe Asn Leu Arg 

85 90 95 

TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG GTC CAA GCC GGC 336 

Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 

100 105 110 

GCC ACC CTC GGA GAA GTT 3 54 
Ala Thr Leu Gly Glu Val 
115 



(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser 
15 10 15 

Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn Leu Arg Phe Asn 
20 25 30 

Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr Ala Glu His Val 
35 40 45 

Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 
50 55 60 

Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 
65 70 75 80 



Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met Phe Asn Leu Arg 
85 90 95 

Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
100 105 110 

Ala Thr Leu Gly Glu Val 
115 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
CAGGCAGCTG TGGTTTGTGG C 21 



(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GTCCACAATG AAGAAGGGTT G 



21 



(2) INFORMATION FOR SEQ ID NO : 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
ACGTAGATAT CGAACAAGAA ACCGC 



(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GCTTTACTAC ACGGGCTTCC CCAG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



CTGGGGAAGC CCGTGTAGTA AAGC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGTACTCCAA CCACGGCGCT C 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CGGGAAGTTG CAGAAGATTG GGTTG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



GAGCAAGAGA AGAAGGAGAC 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 21.. 1608 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



ATATCACATC TTCTTTCAAC ATG CAA ACT TCC ATT CTT ACT CTC CTT CTT 50 

Met Gin Thr Ser lie Leu Thr Leu Leu Leu 
15 10 



CTC TTG CTC TCA ACC CAA TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT 98 
Leu Leu Leu Ser Thr Gin Ser Ser Ala Thr Ser Arg Ser lie Thr Asp 
15 20 25 



CGC TTC ATT CAA TGT TTA CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA 146 
Arg Phe lie Gin Cys Leu His Asp Arg Ala Asp Pro Ser Phe Pro lie 
30 35 40 



ACC GGA GAG GTT TAC ACT CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG 194 
Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser Ser Phe Pro Thr Val Leu 
45 50 55 



CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA 242 
Gin Asn Tyr lie Arg Asn Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys 
60 65 70 



CCC TTT TTA ATC ATC ACA GCC GAA 
Pro Phe Leu lie lie Thr Ala Glu 
75 80 

GTG GTT TGT GGC AAA CAA AAC CGG 
Val Val Cys Gly Lys Gin Asn Arg 
95 

GGT CAT GAT TAT GAA GGT CTT TCC 
Gly His Asp Tyr Glu Gly Leu Ser 
110 



CAT GTT TCC CAC ATT CAG GCA GCT 2 90 
His Val Ser His lie Gin Ala Ala 
85 90 

TTG CTA CTG AAA ACC AGA AGC GGT 33 8 
Leu Leu Leu Lys Thr Arg Ser Gly 

100 105 
TAC CTT ACA AAC ACA AAC CAA CCC 3 86 
Tyr Leu Thr Asn Thr Asn Gin Pro 
115 120 



TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC 
Phe Phe lie Val Asp Met Phe Asn Leu Arg Ser lie Asn Val Asp lie 
125 130 135 



434 



GAA CAA GAA ACC GCA TGG GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG 4 82 
Glu Gin Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly Glu Val 
140 145 150 

TAG TAT CGA ATA GCG GAG AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG 53 0 
Tyr Tyr Arg lie Ala Glu Lys Ser Asn Lys His Gly Phe Pro Ala Gly 
155 160 165 170 

GTT TGT CCA ACG GTT GGC GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT 578 
Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly Gly Tyr 
175 180 185 

GGT AAT TTG ATG AGA AAA TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT 62 6 
Gly Asn Leu Met Arg Lys Tyr Gly Leu Ser Val Asp Asn lie Val Asp 
190 195 200 

GCT CAA ATA ATA GAT GTG AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG 6 74 
Ala Gin lie lie Asp Val Asn Gly Lys Leu Leu Asp Arg Lys Ser Met 
205 210 215 

GGT GAG GAT TTG TTT TGG GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT 722 
Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly Gly Val Ser Phe 
220 225 230 

GGT GTG GTT CTA GCC TAC AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT 770 
Gly Val Val Leu Ala Tyr Lys lie Lys Leu Val Arg Val Pro Glu Val 
235 240 245 250 

GTG ACC GTG TTT ACC ATT GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC 818 
Val Thr Val Phe Thr lie Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr 
255 260 265 

ATC GCG GAA CGA TGG GTA CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT 866 
lie Ala Glu Arg Trp Val Gin Val Ala Asp Lys Leu Asp Arg Asp Leu 
270 275 280 

TTC CTT CGA ATG ACC TTT AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG 914 
Phe Leu Arg Met Thr Phe Ser Val lie Asn Asp Thr Asn Gly Gly Lys 
285 290 295 

ACA GTC CGT GCT ATC TTT CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT 962 
Thr Val Arg Ala lie Phe Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn 
300 305 310 

CTT GTT ACA CTT TTG AAT AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA 1010 
Leu Val Thr Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu 
315 320 325 330 

TCG GAT TGT ACT GAA ATG AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG 1058 
Ser Asp Cys Thr Glu Met Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr 
335 340 345 

GGC TTC CCC AGT GGT ACT CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT 1106 
Gly Phe Pro Ser Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro 
350 355 360 



CAA AGA CTC AAC CCA TTC AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT 
Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr Val Gin Asn Pro 
365 370 375 



1154 



ATT TCT AAA CGA CAG TTC GAG TTC ATC TTC GAA AGG CTG AAA GAA CTT 12 02 
lie Ser Lys Arg Gin Phe Glu Phe lie Phe Glu Arg Leu Lys Glu Leu 
380 385 390 

GAA AAC CAA ATG TTG GCT TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA 12 50 
Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu 
395 400 405 410 

ATA TCC GAA TTC GCA AAG CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG 12 98 
lie Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn lie Ala 
415 420 425 

AAA ATT CAA TAC GAA GTA AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA 1346 
Lys lie Gin Tyr Glu Val Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu 
430 435 440 

AAT CGT TAC TTG AAT TTC AC A AGG CTG ATG TAT GAT TAC ATG ACC CCA 13 94 
Asn Arg Tyr Leu Asn Phe Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro 
445 450 455 

TTT GTG TCG AAA AAC CCT AGA AAA GCA TTT TTG AAC TAT AGG GAT TTG 1442 
Phe Val Ser Lys Asn Pro Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu 
460 465 470 

GAT ATT GGT ATC AAC AGC CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG 14 90 
Asp lie Gly lie Asn Ser His Gly Arg Asn Ala Tyr Thr Glu Gly Met 
475 480 485 490 

GTT TAT GGG CAC AAG TAT TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA 153 8 
Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val 
495 500 505 

AGT GTG AAG ACT AAA GTT GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA 158 6 
Ser Val Lys Thr Lys Val Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin 
510 515 520 

AGC ATC CCA ACT TTG TCA TCT T GAAGAACGTA CATATATAAA TAAATACCTT 1638 
Ser lie Pro Thr Leu Ser Ser 
525 

TGTGCATGGT ATTTTCAGGG TGTTAAAGTG ATATTCAGAT ATTTATGATA GAATTTTGAC 1698 
TTGTATTTTA TACAATCAAA ATTGTATGGT TCTCCGAATT TCTCTTTTTA ATTCTGAAAA 175 8 
ATACATATTA GTATTGTCAA AAAAAA 1784 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gin Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 

Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 

Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 

Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 

Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 

Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

Ala lie Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 

Lys lie Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr lie 
245 250 255 

Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr lie Ala Glu Arg Trp Val 
260 265 270 



Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 



Ser Val lie Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala lie Phe 
290 295 300 



Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

Lys lie Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
370 375 380 

Glu Phe lie Phe Glu Arg Leu Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 

Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 . 480 

His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser lie Pro Thr Leu Ser 
515 520 525 



Ser 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCGCCATGGA GACTTCCATT CTTACTC 2 7 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCCGGATCCT CAAGATGACA AAGTTGGGAT GCT 3 3 



(2) INFORMATION FOR SEQ ID NO : 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE: 

(A) NAME/KEY : CDS 

(B) LOCATION: 1 . . 1590 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



ATG GAG ACT TCC ATT CTT ACT CTC CTT CTT CTC TTG CTC TCA ACC CAA 
Met Glu Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 



48 



TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT CGC TTC ATT CAA TGT TTA 96 
Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 
20 25 30 

CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA ACC GGA GAG GTT TAG ACT 144 
His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 

CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG CAA AAC TAG ATC CGA AAC 192 
Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

CTT CGG TTC T^T GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA 24 0 
Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 

GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA 288 
Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 

AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT 33 6 
Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG 3 84 
Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG 43 2 
Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 

GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG TAC TAT CGA ATA GCG GAG 48 0 
Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG GTT TGT CCA ACG GTT GGC 52 8 
Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT GGT AAT TTG ATG AGA AAA 576 
Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT GCT CAA ATA ATA GAT GTG 624 
Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

AAT GGC AAG CTT 'TTG GAT CGA AAG AGT ATG GGT GAG GAT TTG TTT TGG 672 
Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT GGT GTG GTT CTA GCC TAC 72 0 
Ala lie Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 



AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT GTG ACC GTG TTT ACC ATT 768 

Lys lie Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr lie 
245 250 255 

GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC ATC GCG GAA CGA TGG GTA 816 

Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr lie Ala Glu Arg Trp Val 
260 265 270 

CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT TTC CTT CGA ATG ACC TTT 864 

Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 

275 280 285 

AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG ACA GTC CGT GCT ATC TTT 912 

Ser Val lie Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala lie Phe 
290 295 300 

CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT CTT GTT ACA CTT TTG AAT 960 

Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA TCG GAT TGT ACT GAA ATG 1008 

Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG GGC TTC CCC AGT GGT ACT 1056 

Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT CAA AGA CTC AAC CCA TTC 1104 

Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 

355 360 365 

AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT ATT TCT AAA CGA CAG TTC 1152 

Lys lie Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
370 375 380 

GAG TTC ATC TTC GAA AGG ATG AAA GAA CTT GAA AAC CAA ATG TTG GCG 12 0 0 

Glu Phe lie Phe Glu Arg Met Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA ATA TCC GAA TTC GCA AAG 124 8 

Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG AAG ATT CAA TAC GAA GTA 12 96 

Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA AAT CGT TAC TTG AAT TTC 1344 

Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 

435 440 445 

ACA AGG CTG ATG TAT GAT TAC ATG ACT CCA TTT GTG TCG AAA AAC CCT 13 92 

Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 



AGA GAA GCA TTT TTG AAC TAT AGG GAT TTG GAT ATT GGT ATC AAC AGC 
Arg Glu Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 480 



1440 



CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG GTT TAT GGG CAC AAA TAT 148 8 
His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA AGT GTG AAG ACT AAA GTT 1536 
Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA AGC ATC CCA ACT TTG TCA 1584 
Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser lie Pro Thr Leu Ser 
515 520 525 

TCT TG 1589 
Ser 

530 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Glu Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 

Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 

Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 

Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 

Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 



Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 



Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

Ala lie Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 

Lys lie Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr lie 
245 250 255 

Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr lie Ala Glu Arg Trp Val 
260 265 270 

Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

Ser Val lie Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala lie Phe 
290 295 300 

Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

Lys lie Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
370 375 380 

Glu Phe lie Phe Glu Arg Met Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 



Thr Arg Leu Met Tyr 
450 

Arg Glu Ala Phe Leu 
465 

His Gly Arg Asn Ala 
485 

Phe Lys Glu Thr Asn 
500 

Asp Pro Asp Asn Phe 
515 

Ser 



Asp Tyr Met Thr Pro Phe 
455 

Asn Tyr Arg Asp Leu Asp 

470 475 

Tyr Thr Glu Gly Met Val 
490 

Tyr Lys Arg Leu Val Ser 
505 

Phe Arg Asn Glu Gin Ser 
520 



Val Ser Lys Asn Pro 
460 

lie Gly lie Asn Ser 
480 

Tyr Gly His Lys Tyr 
495 

Val Lys Thr Lys Val 
510 

lie Pro Thr Leu Ser 
525 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 3 50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GAGAAACTCG GAGACTTTCA CACAATGCCT AACCTCAAAC TCCGACCCCA AACATCCCAT 60 

CTCCCCCGCT ATCTTCTTCT CCGGAAATGG CTCCTACTCC TCCGTATTAC AAGCCAACAT 12 0 

CCGTAACCTC CGCTTCAACA CCACCTCAAC TCCGAAACCC TTCCTCATAA TCGCCGCAAC 18 0 

ACATGAATCC CATGTGCAAG CCGCGATTAC TTGCGGGAAA CGCCACAACC TTCAGATGAA 240 

AATCAGAAGT GGAGGCCACG ACTACGATGG CTTGTCATAC GTTACATACT CTGGCAAACC 300 

GTTCTTCGTC CTCGACATGT TTAACCTCCG TTCGGTGGAT GTCGACGTGG 350 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 78 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 278 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGCATGGATC TCCGCCGGAG CGACTCTCGG AGAGGTTTAT TATCGGATTT GGGAGAAAAG 6 0 

CAGAGTCCAT GGATTCCCCG CCGGAGTTTG ACCGACGGTT GGTGTTGGTG GGCATTTAAG 12 0 

CGGCGGTGGT TACGGTAACA TGGTGAGGAA GTTTGGATTA TCTGTGGATT ACGTTGAGGA 18 0 

TGCCAAGATC GTCGATGTAA ACNGTCGGGT TTTAGATCGG AAAGCAATGG GTGAGGATCT 24 0 
GTTCTGGGCG ATTACCGGTG GAGGAGGAGG TAGCGTAC 2 78 



(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHT^RACTERISTICS : 

(A) LENGTH: 34 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TGGACATATT AGCGGAGGAG GATTCGGTAC AATAATGAGG AAATACGGTT TAGCGTCTGA 60 



TAACGTTGTG GACGCACGTT TGATGGATGT AAATGGGAAA ACTCTTGACC GGAAAACGAT 12 0 



GGGAGAGGAT TTGTTTTGGG CGCTTAGAGG CGGTGGAGCT GCGAGTTTTG GCGTTGTCTT 18 0 
GTCGTGGAAG GTTAAGCTTG CTAGGGTTCC TGAAAAGGTA ACTTGTTTCA TAAGTCAACA 240 
TCCGATGGGA CCTAGCATGA ACAAGCTTGT TCATAGATGG CAATCCATAG GATCAAGANN 300 
GCTAGACGAA GATTTATTCA TCAGAGTCAA TATTGACAAC AGTCT 34 5 

(2) INFORMATION FOR SEQ ID NO; 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GTTCGTTAAA ACCTATCCTN NANGGGCNAA AGNATATCAA AGNTTGNTTA NGNAACCCAA 6 0 

NATTTCTGAA CTGGCCNCCT TCGGTGGTAT ATGNCNAAAN CCCTTGAATC TGCGNANCCN 12 0 

ATTCCGCATA GAAACGGAAC CCTCTTCAAG ATTCTCTATT TACNCGAACT GNCTAGANNG 180 

AATGACAAGA CATCGAGTAG NAAAATCAAC TGGATCAAAG AGATATACAA TTACATGGCG 24 0 

CCTTATGTCT CAAGCAATCC AAGACAAGCA TATGTGAACT ACAGAGATCT AGACTTCGGA 300 

CAGAACAAGA ACAACGCAT^ GGTTAACTTC ATTGTU^GCTA AAATCTGGGG ACCTAAGTAC 360 

TTCAAAGGCA ATTTTGACAG ATTGGTGAAG ATTAAAACCA AGGTTGATCC AGAGAACTTC 42 0 

TTCAGGCACG AGCAGAGTAT CCCACCTATG CCCTACTAGA AGCTAGGTTC ATGAAACCAA 480 

TAACATTATC A7UVAATAAGR ATAAATGRTA ATTGTATACA ACATGATTCG KCTTTCTTTA 540 

TTTCAGACAA TGTGGACACT ACTCTAAANT AAAAWGTCNA TTTACCTTAA AAATU^AAATA 600 

ATCCCCNNTA ANANAAAANT GGGGGGGCCN TTTTTGGGGN TCCCGGTTTT NGGACGGGGN 66 0 

GCTTTNGGGG GGCTTGGNNT TTTTTTNGGN GCCCC 6 95 



(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : doiible 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
{ i i i ) HYPOTHET I C AL : NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 4 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCTGTTTTNA GGCAGAGCAG AGGAAGTTGT TGCTTTGCTT GGTAAGGAGT TTCCTGAATT 6 0 
NAGTTTAAAG AAGGAGAACT GTTCGGAGAT GACTTGGTTT CAGTCAGCTT TATGGTGGGA 12 0 
TAATCGTGTT AACCCTACTC ANATTGATCC WAAAGTGTTT CTCGATCGGA ATCTTGATAG 18 0 
AGCGAATTTC GGAAAGAGGA AATCGGATTA CGTTGCGAGT AAGATTCCTA GAGATGGGAT 24 0 
TAAGYCTTTT TCCAAGARGA TGMCTGACCT GGGGAAAAYC GGGCTTGTTT TTAAWCCGTA 3 00 
TGGTGGGAAA ATGGCGGAGG TTACGGTTAA CGCGACGCCG TTTCCNCACC GAAGCAAGCT 36 0 
TTTTAAGATT CAGTACTCGG TGACTTNGCA AGAAAACTCT NTCGAGATAG AGAAAGGGTT 42 0 
TCTTGAATCA GGCTAACGTC CTTATAGGTT CATGACCGGG TTTTTNAGCA AGANCCCTGG 480 
AATNCTTACT TNAAT 4 95 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 



(vi) 



ORIGINAL SOURCE: 



(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AAATTAAAAC AAATCAATGT TGATATTGAA TCCAATAGTG CTTGGTTTCA ACCTGGTGCT 6 0 
ACGCTTGGTG AGCTTTACTA CAGAATTNCA GAGAAGAGCA AAATCCATGG ATTTCCNGCG 120 
GGTTTNTNCA CAAGCNTAGG CATAGGTGGG TATATNANAG GCGGTGGATA CGGTACCTTG 180 
ATGAGGAAGT ATGGTCTTNC GGGA 2 04 



(2) INFORMATION FOR SEQ ID NO : 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GAGATTTCTC GAGCAAGATA CTCCACTGAT GATCTTTGAG CCATTGGGTG GGAAAATCAG 6 0 
CAAGATTTCA GAAACAGAAT CTCCATATCC ACACAGAAGA GGTAATCTGT ATAATATACA 12 0 
GTACATGGTG AAATGGAAAG TGAATGANGT CGAGGAGATG AACAAACATG TCAGGTGGAT 180 
GAGATCGTTA CACGATTACA TGACTCCGTA TGTTTCTAAA TCGCCGAGAG GAGCTTATTT 24 0 
GANTTACAGA GATCTTGATT TGGGCTCGAC CAAAGGGATT AACACGGGTT TCGGAGATGC 3 00 
AAGGAAATGG NNGGGTGAGN CTTTTTTCAA AGGTAATTTC CAAGGGGTTA GGTTTTGGTT 36 0 
AAAGGGGAGG TTTNNCCCAN CAAATTTTTT TTCAGGANCC GGCCANGNTT TTCCCCCCCC 42 0 



TNTTTTTNGG NCCCCAATCN AAANCCCCGT TTTAAAAGGG GGGCCATTTC NTTTTTTNCA 48 0 



NNTTAAAAGG G 



491 



(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 407 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATTTGTTCGT GAGGTTAACT TTGACTTTAG TCAACGGTAC GAAGCCTGGT GAGAATACGG 6 0 

TTTTAGCGAC TTTCATTGGG ATGTATTTAG GCCGGTCGGA TAAGCTGTTG ACCGTNATGA 12 0 

ACCGGGATTT CCCGGAGTTG AAGCTGAAGA AAACCGATTN TACCGAGATG AGATGGATCG 180 

ATTCGGTTCT GTTTTGGGAC GATTATCCGG TTGGTACACC GACTTCTGTG CTACTAAATC 24 0 

CGCTAGTCGC AAAAAAGTTG TTCATGAAAC GAAAATCGGA CTACGTGAAG CGTCTNATTT 300 

TCGAGAACCC GATCTCNNGT TTGATACTCA AGAAATTTGT AGAGGTTNNG AAAGTTAAAA 360 

TNAATTTGGA TCCGCATTNN GGNANNNATG GTGAAACCCC NNGTTNT 407 



(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 



(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3 . .36 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 29: 

ACGGCGTCGT ATTGGCCTAC AAAATAAACC TTGTTGAAGT CCCAGAAAAC GTCACCGTTT 60 

TCAGAATCTC CCGGACGTTA GAACAAAATG CGACGGATAT CATTCACCGG TGGCAACAAG 12 0 

TTGCACCGAA GCTTCCCGAC GAGCTTTTCA TAAGANCAGT CATTGACGTA NAAACGGCAC 180 

TGTTTCATNN CTCAAAAGAC CGTCAGACAA CATTCATAGC AATGTTTCTA GGAGACACGN 24 0 

CAACTCTACT GTCGATATTA AACCGGAGAT TCCCAGAATT GGGTTTGGTC CGGTCTGACT 3 00 

GTACCGNAAC AAGCNNTTGG ATCCAATCTG TGCTATTTTT GGGACAAATA TCCCAGGTTG 36 0 

(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3.. 427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 30: 

TCTTCACTGT CACCAAAACG TTAGAACAAG ACGCAAGATT GAAGACTATT TCTAAGTGGC 6 0 

AACAAATTTC ATCCAAGATT ATTGAAGAGA TACACATCCG AGTGGTACTC AGAGCAGCTG 12 0 

GAAATGATGG 7VAACAAGACT GTGACAATGA CCTACCTAGG TCAGTTTCTT GGCGAGAAAG 180 

GCACCTTGCT GAAGGTTATG GAGAAGGCTT TTCCAGAACT AGGGTTAACT CAAAAGGATT 24 0 

GTACTGAAAT GAGCTGGATT GAAGCCGCCC TTTTCCATGG TGGRTTTCCA ACAGGKTCTC 3 00 

CTATTGAAAT TTTGCTTMAG CTCAAGTCGC CTYTAGGAAA AGRTTWCTTC AAAGCAACGK 360 

CGGATTTCGT TAAAGAACCT WTTCCTGTGA TAGGGCTCAA AGGAATATTC AAAAGATTGA 420 



TTGAAGG 



427 



(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..437 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GTTGTACTAT CATNGAAGAT TAAGTTAGTC GATGTTCCGT CCACGGTCAC CGNGTTTAAA 6 0 

GTCCAGAAAC ATNAGGAGAA AGAGGCCGTT AGGNTCATCA ACAAGTGGCA GTATGTTGCG 12 0 

GATAAGGTCC CTGAAGATCT TTTCATCAGC GCAACGTTGG NGAGATCAAA CGGAAACTCT 18 0 

GTGCAGGCTT TGTTTACTGG ACTCTATCTT GGNCCGGTGA ATAATNTCTT GGCCTTGATG 24 0 

GAAGAT^AAGT TTCCAGANTT AGGTCTTGAT ATCCAAGNCT GCACAGAGAT GAGTTGGGCT 3 00 

GAATCTGCAC TCTGGTNTNC TGNTTTCNCT AAAGGAGAGN CTCCTTGGGT GTTCCNCGCG 360 

GATCGGNAGC GGNCAATTTN TGGNCTTTCA AGGGGAAAGN CGGCTTTTTN CAAGT^CCCG 420 

NTACCCGGGG TTCAATT 43 7 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 



(A) ORGANISM: Arabidopsis thaliana 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..441 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GCGGACCCTA TAGATCANNA TGTGCTACTG ANAGAAGAGG AAGCCAAGAA CAAGCCGGAG 6 0 

ACAGATAAAT ATCTGAAATG GGNCGATANC GTTTACGAAT TTATGACNCC ATATGTTTCG 12 0 

AAATCTCCAA GAGGAGCTTA TGTCAATTTC AAGGATATGG ATTTGGGTAT GTATCTTGGA 180 

AAGAAGAAGA CAAAGTACGA GGAAGGAAAG AGTTGGGGAG' TGAAGTATTT CAAGAACAAT 24 0 

TTCGAGAGAT TGGTGAGAGT GAAGACTAGG GTTGATCCAA CAGATTTCTT CTGCGATGAA 300 

CAGAGCATTC CTCTGGTGAA CAAAGTTACC TGAAGATATC ATTTGAAGTT TTTTATTAGT 360 

CCCTTTTCTC TGTGAAATCA TCTGTGCGTG TTGAATATTA TGCGTCAAGT GTGTAACTTA 42 0 

TGTGTGTGAT TGTGAATTGT G 441 

(2) INFORMATION FOR SEQ ID NO : 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) AITTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CTGGCTTAAC ACAACGTCGT TTTGGGCCAA TTACCCGGCG GGTACACCCA AGAGCATCCT 60 

TCTAGATAGG CCTCCGACGA ATTCAGTGTC ATTTAAGAGT AAATCGGATT TTGTCAAAAA 12 0 

ACCAATACCC AAAAAAGGTT TAGAGAAGCT TTGGAAGACA ATGTTTAAAT TCAACAGTAG 18 0 

CGTCTCGTTG CAATTCAACC CTTACGGTGG AGTGATGGAC CGGATTCCGG CAACGGCCAC 24 0 



CGCTTTTCCT CATCGGAAAG GAAACTTGTT CAAGGTTCAA TACNCTACGA TGTGGTTTGA 300 



CGCAAACGCC ACACAGAGTA GCCNGGCTAT 
GTACGTGNGT CAAGTAAACC CGAGANANGG 
NGGAGCAANN CCAAGTGGGG GGGNCCAACC 
CAGAATTTCC TTNGGGGAAT TT 



GATGAATGAG CTTTTTGAGG TGGCGGGACC 3 60 
CTTCCTTTAA NTTCAGAGNC CATCGNTNTT 42 0 
GGGGGNTNAA ANCNNAGNTC TTNGGGGGCC 48 0 

502 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 400 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

NGGGAATTGC NCGAGGNAAG TTGTACCCAA TTCCTGGACC ACCATTGGTT TCCCAAGAAN 6 0 

CCCGAGACAA CCGTTTTTCA ATNACCGTGA TGTTGATTTG GGTATTAATT CTCATAATGG 12 0 

TAAAATCAGT AGTTATGTGG AAGGTAAACG TTACGGGAAG AAGTATTTCG CAGGTAATTT 18 0 

CGAGAGATTG GTGAAGATTA AGACGAGAGT TGATAGTGGT AATTTCTTTA GGAACGAACA 24 0 

GAGTATTCCT GTGTTACCAT AAGTGTATTT ATTTGATTAT TGGTTAGTGA AATTTGTTGT 3 00 

TGTATAATGA TTATATGTCG TATTTTTATT TATTATTAGT AATTTATAAA GTTTGATATT 360 

AAATACAAAT AGTATAATAA GATAGTTTCT TTTAGTAAAA 4 00 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dovible 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 383 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CAACTCTAAT GGGAACACCT ACTTCGATCG AATGTCGATG GGGGAAGAGC TTTTCTGGGC 60 
GGTTCGAGGA GGTGGAGCCG CGAGTTTCGG CATCGTGATG GGATACAAAA TCCGGTTGGT 12 0 



TCCGGTTCCG GAGAAAGTTA CGGTTTTTAG 
TGATCTTATA ATGAAGTGGC AGAACTTCTC 
GCTGANTTTT GANTTTAGTC AACGGTGCAA 
TCANTTTGGN TGNAANCTTG GGGGTTTTAT 
TTTCCCGGGG TTAAAACCTT NGG 



CGTCGGAAAA ACCGTCGGAG AAGGAGCCGT 18 0 
TCATAGTACG GNTCGGAATT TNTTTGTGAA 24 0 
AGCCGGGTGA AAAAAAGGTT TTAGNGNCTT 3 00 
NAGAACGGTT AACCGGGATT NANCCCGNGT 36 0 

383 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 54 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : doiible 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
ATCAATGTTC TTACTAAACG TACACGAGCA TCGTTGGCTT TCAAGGCTAA ATCTGATTTT 60 
NTTCAAGAAC CGATNCCTAA AACCGCGATT TCGAAGCTTT GGAGACGGTT GCAAGAACCG 120 



GAAGCAGAGC ATGCTCAGCT AATTTNCACN CCATTTGGTG GTAAAATGAG TNAGATTGCA 18 0 



GATTACGAAA CACCATTTCC GCATAGGAAG GGGAATATAT ATNAGATTCA GTACTTGAAT 24 0 
TACTGGAGAG GAGACGTGAA AGAGTU^GTAT ATTGAGATNG GTGGAGGAGA GTTTACGGTT 300 
GNTATNAGTA AGTTTTTTGG CGAAGTNTNC CNAGAGGNGN CTTNNTNTAA ACCT 3 54 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to TTlRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidposis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37: 
TTTTTTAGTA CACTAATAAT CAAATGGAAT GAGAAATGAA GCCACAAAAG TATCTGCAAT 6 0 
CAAAATATCC TGCTATCTCC ATCTCAAGCT CTCAATAGTA TCCTCTCCGA AAGTGAAATC 12 0 
AACATTTCAA ACTCTATTTC TTGGTGGAAT CGATAGACTG ATTCCTCTGA TGAACCAGAA 18 0 
GTTTCCGGAA CTCGGCTTAC GATCTCAAGA CTGTTCGGAA ATGAGCTGGA TCGAATCGAT 24 0 
AATGTTCTTC AACTGGAGAT CAGGACAGCC GTTAGAGATT TTGCTCAACA GAGACCTAAG 30 0 
GATTCGAGGA TCAGTATTTC AAAGCAT^GT CAGGATTATG GTTCAAAAAC CCGTTCCTGA 36 0 
AAACGTTTTT CGAAGAGGTA TCCAAGGGGT TTCTCGAGCA AGT 4 03 



(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to tiiRNA 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATtJRE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GAGATGAGTT GGATTAANTC TGTACTCTGG TTTGCTGATT TCCCTAAAGG AGAATCTCTT 60 

NGTGTTCTCA CGAATCGTAA GCGTACATCT CTATCTTTNA AAGGCAAAGA TGATTTTATC 120 

CAAGAACCGA TACCCGAGGC TGCAATTNAA GAGATATGGA GGCGATTAGA AGCCCCCNAG 180 

GCTCGGCTTG GAAAGATCAT ATTAACTCCA TTTGGTGGGA AAATNAGTGA AATGGCAGAG 24 0 

TACGTANCAC CATTCCCACA 26 0 



(2) INFORMATION FOR SEQ ID NO : 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 605 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CTCTTGCATA TTCGCTGCAA GGATGGGAAA TTCAAAACCA CTCCCTACAA TTTTTTGTAT 60 
TATAGTTTCA GTCTTGTATT TTTAATTCTA TTGCATAACA CCAACTTCTT CATCAGCCTC 12 0 
CATCCAAGAT CAATTCATAA ACTGTGTCAA AAGAAACACA CATGTTTCTT TTCCACTCGA 180 
GAAAACGTTA TTCACCCCTG CGAAAAACGT CTCTTTGTTC AACCAAGTCC TTGANTCGAC 240 
GGCTCAAAAT CTCCAGTTCT TGGCAAAATC CATGCCTAAA CCGGGRTTCA TATTCAGACC 3 00 
GATTCACCAG TCTCAAGTCC AAGSTTCCAT CATTTGTTCA AMGRAACTCG GGNTTCATTT 360 



TNGTGTTTGA NGTGGCGGTC ACGATTTTCG 

AAAAACCGTT TATATTACTC GGCCTGTCAA 

GNATTCCAAA TAGGTNCTTG GGGTNAACCT 

CAAGAATTTG CANGNGGANG TGCAAAGATT 
AATGT 



AGGCCTTTGT NTTTATGTTT CACGGTTTGA 42 0 
ANTTGNANNC AAAATCANAT GTTGGATATT 480 
GGTGGCTANC GTTTGGTGAG CTTTTACTTT 54 0 
CCATGGGATT TCCCGGGGGG TTTNTTGCAC 600 

605 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

AACACAAAAC TCTTCCATTT GGCTTCTCTC TTGCATATTC GTTGCAAGGA TGGGAAATTC 60 

AAAACCACTC CCTACAATTN CTTGTATTAT CGTTTCAGTC TTGTATTTTN NATTCTATTG 12 0 

CATAACACCA ACTTCTTCAT CAGCCTCCAT CCAAGNTCAA TTCATAAACT GTGTCAAAAG 180 

GAACACACAT GTTTCTTTTC CACTCGAGNA AACGGTATTC ACTCCTGCGG AAAACGGCTC 24 0 

TNTTATTCAA CGGGTCCNTG AATCGACGGG TCAAAATCTC CAGTTCTTGG NAAAATCCAT 300 

GNCTAAACCG GGGTTCATAT TCAGGCCGGT TCACCAGTCT CAAGTCCAAG NTTCCATCAT 36 0 

TTGTTCAAAG GAACTCGGGA TTCATTTCCG CGNTAGAAGT GGCGGGCANN GGTTTCGGGG 42 0 

CCTGTCTNTT GNTTANGGGN AGGAAAACCG GTTNTATTNC TCGG 464 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to ttlRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..386 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 41: 

TCGGGAGCCC ANGNTAAATT ANNTGAAAAT GGGGNCGNAT ANCCGTTTAC NGAATTTTAT 60 

GACNCCCAAT ATGTTTCGAA ATCTCAAAGA NNGGGANCTT ATGTCAATTT CAAGGATATG 120 

GATTTGGGTA TGTATCTTGG AAAGNAGAAG ACAAAGTACG AGGAAGGAAA GAGTTGGGGA 18 0 

GTGAAGTATT TCAAGAACAA TTTCGAGAGA TTGGTGAGAG TGAAGACTAG GGTTGATCCN 24 0 

ACAGATTTCN TCTGCGATGA ACAGAGCATT CCTCTGGTGN ACAAAGTTAC CTGAAGATAT 3 00 

CATTTGAAGT TTTTTATTAG TCCCTTTTCT CTGTGAAATC ATCTGTGCGT GTTGAATANT 3 60 

ATGCGTCAAG TGTGTAACTT ATGTGT 386 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) H YPOTHET I CAL : NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 60 



TGGTGGTGGG GAGCTGTAAA CGTATGGTGG 

TTCATTGAAG CTAAAATCTG GGGACCTAAG 

AAGATTAAAA CCAAGGTTGA TCCAGAGAAC 

ATGCCCTACT AGAAGCTAGG TTCATGAT^C 

ATAATTGTAT ACAACATGAT TCGTCTTTCT 

ATAAAATGTC ATTTACC 



TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 12 0 
TACTTCAAAG GCAATTTTGA CAGATTGGTG 18 0 
TTCTTCAGGC ACGAGCAGAG TATCCCACCT 24 0 
CAATAACATT ATCAAAAATA AGAATAAATG 3 00 
TTATTTCAGA CAATGTGGAC ACTACTCTAA 360 

377 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 60 
TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 120 
TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA CAGATTGGTG 180 
AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG TATCCCACCT 24 0 
ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA AGAATAAATG 3 00 
ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC ACTACTCTAA 36 0 
ATAAAATGTC ATTTACC 3 77 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNE S S : doubl e 




(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 346 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GAGCTGTGGA TATGGTCACA AATGGCAATC GGTTGGTCCG AAAACTGATC CGAATCTTTT 60 

TATGAGAATN TTGATTCAAC CAGTGACGAG GAAGAAGGTA AAGACTGTGA GAGCTTCTNT 12 0 

GGTTGCCCTN TTTTNAGGCN AGACAGATGA AGTTTTTGCT TTCCTTAGTA AGGAGTTTCC 180 

TGAATTGGGT TTAAAGAAGG AGAATTNTTC GGAGATGACT TGGTTTCANT CTGCTTTATG 24 0 

GTGGGACAAT CGTCTTAATG CTACTCAGGT TGATCCTAAA GTNTTTCTTG ATCGGAATCT 3 00 

CGATACCTCG AGTTTCGGTA AGAGGAAATC GGATTACGTC GCGACT 34 6 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 261 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATGGGGTGAG ACTTATTTCA AAGGTAATTT CAAGAGATTA GGTTTGGTTA AAGGGAAGNT 60 



TGATCCAACA AATTTCTTCA GGAACGAACA GAGTATTCCT CCTCTGTTTT GAGTCCTCAA 120 



TAC7VAAACCA GATATAAAAG ATGTCATTTC ATTTTTTCAA TTATAATAGA TAATGTAACT 18 0 
TTCTGCTACA ATTGTAAAAG TGAGATGTAC CCAATACGGT TTAAGCGGAC CGAGAATAGT 24 0 
CAATTCAAAG ACCAAATTCT G 261 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..478 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GCTCAAAGGA CTAACCATGA AAACTTCCTC AAGTGTCTCT CTCACCGANT CAACGAGGAC 6 0 

GACTCAAGAN TTATACACAC ATCAAAAGAT CCTTCGTATT TNTCAATCTT GATTTCTTCC 12 0 

ATACAAAATC CAAGTTTCTC TGTTCTCGAA ACACCTAAAC CGGTTTCAAT CATCACTCCG 18 0 

GTTCAAGCCA CCGATGTTCA ATCTACGNTT AAATNCGCAC GGNCTTCACG GGTATACACA 24 0 

ATCAGGGCTA GGAGTGGTNG TCATGACTAC GGAGGTTTAT CTTTACATTG GCTTAAAAAN 3 00 



CANNCCGTTC GTTNNTCATT GATTTNNAGA AATCTTCCGG GCTTATTTAA CATNTAAGAT 360 
GTTTGATAAN CCGGNNCCNG TTTGGGGTTC AAATCCCGGT GGCTTACTUU^ NTTNGGGGGA 42 0 
ATTGTNCCTA TGAGGTTTGG AAAATTAANG CAAAATNTTT TGGGCCTTCC CGGCCGGT 478 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) 



HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . , 579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GGCCGTTAGG ATCATCAAGA AATGGCAATA TGCTGCAGAT AAGGTTCCTG ATGATCTTTT 6 0 

CATTAGGACA ACATTGGAGA GATCAAACAA GAACGCAGTA CACGCTTTGT TCACTGGACT 12 0 

ATATATTGGT CCGGTGAACA ATCTATTGGC GTTGATGGAA GAAAAGTTTC CGGAACTAGG 18 0 

TCTTGAGAAA GAAGGTTGTG AAGAGATGAG TTGGATTGAG TCTGTACTCT GGTTTGCTGA 24 0 

TTTCCCTAAA GGAGAATCTC TTGGTGTTCT CACGAATCGT GAGCGTACAT CTCTATCTTT 30 0 

CAAAGGCAAA GATGATTTTG TCCAAGAACC GATACCCGAG GCTGCAATTC AAGAGATATG 36 0 

GAGGCGATTA GAAGCCCCCG AGGCTCGGCT TGGAAAGATC ATATTAACTC CATTTGGGTG 42 0 

NGGNAAAATG AGTGAAATGG CAGAGNCCGA ACCACCAATT CCCACANNCG AGGGAGGGGA 48 0 

ACCCCTNTGN GGNTCAGAAT GTGGTTCCTG GNNNNNAAGN GGGNGCCAGN ACCAANCCGG 54 0 

GNCNGTAAAN CNTGNAATGG GCCNAACCCG TNCCGGATT 57 9 



(2) INFORMATION FOR SEQ ID NO : 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(B) STRAIN: Nipponbare , siibsp . japonica 

(D) DEVELOPMENTAL STAGE: etiolated shoot (8 days old) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 252 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



TGTCCTGGAA GGTCCGCCTC GTGCAGGTTN 

GGAACGTCGA CCAGGGCGCC GCNGACGTCG 

TCCCTCCCGA GCTCACCATA CGGGTGATCG 

TGTACCTCGG CTCGTGCGCC GACCTGGTGC 

GGATGACGAT TG 



CGACGACGGT GACGGTGTTC GTCGTCGGGA 6 0 
TCGCCAGATG GCAAGACGTC GCGCCGAGCC 12 0 
TNCGAGGGCA GCGCGCCACG TTCCAGTCGC 180 
CGACGATGAG CAGCATGTTC CCGGAGCTCG 24 0 

252 



(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= Ambiguous 
/note= "Xaa = Cys or Ser" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 20.. 21 

(D) OTHER INFORMATION: / label = ambiguous 

/note= "Xaa-Xaa probably is Ser-Phe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Xaa Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Xaa Xaa 

20 



(2) INFORMATION FOR SEQ ID NO : 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) 



MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= unknown 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /labels ambiguous 
/note= "Xaa = probably Ser" 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: / label = ambiguous 
/note= "Xaa = probably Trp" 



(ix) FEATURE: 

(A) N7VME/KEY: Modified- site 

(B) LOCATION: 2 4 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Tyr" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



Xaa lie Xaa Val Xaa lie Glu Asp Glu Thr Ala Xaa Val Gin Ala Gly 
15 10 15 

Ala Thr Leu Gly Glu Val Tyr Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Tyr Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ACTTCTACTT CTATTATTGA TAGGTTTACT CA 32 

(2) INFORMATION FOR SEQ ID NO : 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..405 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ACT TCT ACT TCT ATT ATT GAT AGG TTT ACT CAA TGT CTA AAC AAC CGA 4 8 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
15 10 15 



GCT GAC CCT TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC 
Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 30 



96 



TCC TCT TTT CCA TCC GTC TTG CAA GCT TAG ATC COG AAC CTC CGA TTC 144 
Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe 
35 40 45 



AAT GAA TCC ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC 192 
Asn Glu Ser Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His 
50 55 60 

CCT TCA CAC ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG 24 0 
Pro Ser His lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu 
65 70 75 80 

CTA ATG AAA ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCC TAT 288 
Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 

GTG ACC AAT TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA 336 
Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 110 

CGC TCC ATA AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCC 384 
Arg Ser lie Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 

GGC GCC ACC CTC GGA GAA GTT 4 05 

Gly Ala Thr Leu Gly Glu Val 
130 135 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 30 

Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe 
35 40 45 

Asn Glu Ser Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His 
50 55 60 

Pro Ser His lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu 
65 70 75 80 



Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 



Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 110 



Arg Ser lie Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 

Gly Ala Thr Leu Gly Glu Val 
130 135 



(2) INFORMATION FOR SEQ ID NO : 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CACGTTTATG GAGCGTAAGT TGAAC 2 5 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CACCCTTCAC ACATTCAAGC AGC 2 3 



(2) INFORMATION FOR SEQ ID NO : 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

{B) LOCATION: 7 . . 1626 



(ix) FEATURE : 

(A) NAME/KEY: unsure 

(B) LOCATION: replace (372, "g") 

(ix) FEATURE : 

(A) NAME/KEY: unsure 

(B) LOCATION: replace (379, "g") 



(ix) FEATURE : 

(A) NAME/KEY: unsure 

(B) LOCATION: replace (786, "t") 



(ix) FEATURE : 

(A) NAME/KEY: unsure 

(B) LOCATION: replace ( 1105 .. 1106 , "ga") 

(D) OTHER INFORMATION: /note= "also possible "gg" and "aa"" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ACAAAA ATG GCA ATT ACC TAT TCT TTC AAC TTC AAA TCT TAT ATT TTT 4 8 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe 
15 10 



CCT CTC CTC CTT GTC TTG CTC TCT ACC CAT TCA TCA GCG ACT TCA ACT 96 
Pro Leu Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr 
15 20 25 30 



TCC ATT ATA GAT CGC TTC ACC CAA TGT CTA AAC AAC CGA GCT GAC CCT 144 
Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro 
35 40 45 

TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC TCC TCT TTT 192 
Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe 
50 55 60 



CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC AAT GAA TCC 24 0 
Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe Asn Glu Ser 
65 70 75 



ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC CCT TCA CAC 2 88 
Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His Pro Ser His 
80 85 90 



ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG CTA ATG AAA 33 6 
lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys 
95 100 105 110 



ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCC TAT GTG ACC AAT 
Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Thr Asn 
115 120 125 



384 



TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA CGC TCC ATA 4 32 

Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser lie 
130 135 140 

AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCT GGT GCG ACT 480 
Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr 
145 150 155 

CTT GGT GAA GTC TAC TAC CGA ATA GCA GAG AAA AGC AAC AGT CAT GCT 528 

Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu Lys Ser Asn Ser His Ala 
160 165 170 

TTT CCG GCT GGC GTT TGC CCT ACT GTT GGA GTT GGT GGC CAT TTT AGT 576 

Phe Pro Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser 
175 180 185 190 

GGT GGT GGT TAT GGT AAC TTG ATG GGA AAA TAC GGC CTT TCT GTT GAC 624 

Gly Gly Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp 

195 200 205 

AAT ATT GTC GAT GCT CAG TTA ATC GAT GTG AAT GGT AAA CTT CTG AAT 6 72 

Asn lie Val Asp Ala Gin Leu lie Asp Val Asn Gly Lys Leu Leu Asn 
210 215 220 

CGG AAA TCA ATG GGT GAA GAT CTT TTT TGG GCC ATC ACA GGT GGT GGT 72 0 

Arg Lys Ser Met Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly 
225 230 235 

GGT GTC AGC TTT GGT GTG GTT GTA GCG TAC AAG ATC AAA CTG GTT CGT 76 8 

Gly Val Ser Phe Gly Val Val Val Ala Tyr Lys lie Lys Leu Val Arg 
240 245 250 

GTT CCT ACC ACT GTG ACC GTT TTT AAC GTA CAA AGA ACA TCC GAG CAG 816 

Val Pro Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin 
255 260 265 270 

AAC CTA AGC ACC ATA GCC CAC CGA TGG ATA CAA GTT GCG GAT AAG CTC 864 

Asn Leu Ser Thr lie Ala His Arg Trp lie Gin Val Ala Asp Lys Leu 

275 280 285 

GAT AAT GAC CTT TTC CTT CGA ATG ACC TTT AAC GTG ATA AAC AAC ACA 912 

Asp Asn Asp Leu Phe Leu Arg Met Thr Phe Asn Val lie Asn Asn Thr 
290 295 300 

AAT GGC GAA AAG ACG ATA CGT GGT TTG TTT CCA ACA CTG TAC CTC GGA 960 

Asn Gly Glu Lys Thr lie Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly 
305 310 315 

AAC TCT ACC GCT CTT GTT GCC CTC CTG AAC AAG GAT TTC CCT GAA TTA 1008 

Asn Ser Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu 
320 325 330 

GGT GTA GAA ATT TCA GAT TGT ATT GAA ATG AGT TGG ATC GAG TCT GTT 1056 

Gly Val Glu lie Ser Asp Cys lie Glu Met Ser Trp lie Glu Ser Val 
335 340 345 350 



CTT TTC TAG ACA AAC TTC 
Leu Phe Tyr Thr Asn Phe 

355 



CCC ATT GGT ACT CCG ACC ACT GCT CTT CTA 
Pro lie Gly Thr Pro Thr Thr Ala Leu Leu 
360 365 



1104 



AGC CGT ACA CCT CAA AGA CTA AAC CCA TTC AAA ATC AAA TCT GAT TAC 1152 
Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr 
370 375 380 

GTA AAA AAC ACT ATT TCC AAA CAG GGA TTC GAA TCC ATA TTT GAA AGG 12 0 0 
Val Lys Asn Thr lie Ser Lys Gin Gly Phe Glu Ser He Phe Glu Arg 
385 390 395 

ATG AAA GAA CTC GAA AAC CAA ATG CTA GCT TTC AAC CCT TAT GGT GGA 124 8 
Met Lys Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly 
400 405 410 

AGA ATG AGC GAA ATT TCC GAA TTT GCA AAG CCT TTT CCC CAT CGA TCA 12 96 
Arg Met Ser Glu He Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser 
415 420 425 430 

GGG AAT ATA GCG AAG ATC CAA TAC GAA GTA AAC TGG GAT GAA CTT GGC 1344 
Gly Asn He Ala Lys He Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly 
435 440 445 

GTT GAA GCA GCC AAT CGG TAC TTG AAC TTC ACA AGG GTG ATG TAT GAT 13 92 
Val Glu Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp 
450 455 460 

TAT ATG ACT CCG TTT GTT TCT AAG AAC CCC AGG GAA GCA TTT CTG AAC 144 0 
Tyr Met Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn 
465 470 475 

TAC AGG GAT TTA GAT ATT GGT GTC AAC AGT CAT GGC AAG AAT GCT TAC 14 88 
Tyr Arg Asp Leu Asp He Gly Val Asn Ser His Gly Lys Asn Ala Tyr 
480 485 490 

GGT GAA GGA ATG GTT TAT GGG CAC AAG TAT TTC AAA GAG ACG AAT TAT 153 6 
Gly Glu Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr 
495 500 505 510 

AAG AGG CTA ACG ATG GTG AAG ACG AGG GTT GAT CCT AGC AAT TTT TTT 1584 
Lys Arg Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe 
515 520 525 



AGG AAT GAG CAA AGT ATC CCA ACT TTG TCA TCT TCA TGG AAG 1626 
Arg Asn Glu Gin Ser He Pro Thr Leu Ser Ser Ser Trp Lys 
530 535 540 

TAAATTCTAA ATTCACTTGT GAAATTGAAT AAAAGTATGG CTTTTTCAAG GTCATGGTAT 1686 

CCAGATTCAG ATGATATTGA TATAATTTTG ACTTGTATTT ATACAAACAA AATTATATTA 1746 

TATTTTTCTG AATTTAGATT TTCCATTCTT TGGAAAAATA TACGAACATT GATGTTGATA 18 06 

TTTTTAAGAA TTATAGATTT TGAACATTGT GAACAATGAA TAAACCGAGG ACTTCCCTTG 1866 

GGTTTTTTTT ATAAGTATGT AATAGCATGT CTTTAATCAA GATAACCGAT CATTGGATGC 1926 



AATTTATTAT TATAAA.CCTT ATTTAAAAAA AATUUUUU^AA AAAAAAAAAA AAAAA 



1981 



(2) INFORMATION FOR SEQ ID NO : 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe Pro Leu 
15 10 15 

Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr Ser lie 
20 25 30 

lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro Ser Phe 
35 40 45 

Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe Pro Ser 
50 55 60 

Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe Asn Glu Ser Thr Thr 
65 70 75 80 

Pro Lys Pro lie Leu lie lie Thr Ala Leu His Pro Ser His lie Gin 
85 90 95 

Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys Thr Arg 
100 105 110 

Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Thr Asn Ser Asn 
115 120 125 

Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser lie Asn Val 
130 135 140 

Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly 
145 150 155 160 

Glu Val Tyr Tyr Arg lie Ala Glu Lys Ser Asn Ser His Ala Phe Pro 
165 170 175 

Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly 
180 185 190 

Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp Asn lie 
195 200 205 

Val Asp Ala Gin Leu lie Asp Val Asn Gly Lys Leu Leu Asn Arg Lys 
210 215 220 

Ser Met Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly Gly Val 
225 230 235 240 



Ser Phe Gly Val Val 
245 

Thr Thr Val Thr Val 
260 

Ser Thr lie Ala His 
275 

Asp Leu Phe Leu Arg 
290 

Glu Lys Thr lie Arg 
305 

Thr Ala Leu Val Ala 

325 

Glu lie Ser Asp Cys 
340 

Tyr Thr Asn Phe Pro 
355 

Thr Pro Gin Arg Leu 
370 

Asn Thr lie Ser Lys 
385 

Glu Leu Glu Asn Gin 

405 

Ser Glu lie Ser Glu 

420 

lie Ala Lys lie Gin 
435 

Ala Ala Asn Arg Tyr 
450 

Thr Pro Phe Val Ser 
465 ^ 

Asp Leu Asp lie Gly 
485 

Gly Met Val Tyr Gly 
500 

Leu Thr Met Val Lys 
515 

Glu Gin Ser lie Pro 
530 



Val Ala Tyr Lys lie 
250 

Phe Asn Val Gin Arg 
265 

Arg Trp lie Gin Val 
280 

Met Thr Phe Asn Val 
295 

Gly Leu Phe Pro Thr 

310 

Leu Leu Asn Lys Asp 
330 

lie Glu Met Ser Trp 
345 

lie Gly Thr Pro Thr 
360 

Asn Pro Phe Lys lie 
375 

Gin Gly Phe Glu Ser 

390 

Met Leu Ala Phe Asn 
410 

Phe Ala Lys Pro Phe 
425 

Tyr Glu Val Asn Trp 
440 

Leu Asn Phe Thr Arg 
455 

Lys Asn Pro Arg Glu 
470 

Val Asn Ser His Gly 
490 

His Lys Tyr Phe Lys 
505 

Thr Arg Val Asp Pro 
520 

Thr Leu Ser Ser Ser 
535 



Lys Leu Val Arg Val Pro 
255 

Thr Ser Glu Gin Asn Leu 
270 

Ala Asp Lys Leu Asp Asn 
285 

lie Asn Asn Thr Asn Gly 
300 

Leu Tyr Leu Gly Asn Ser 
315 320 

Phe Pro Glu Leu Gly Val 
335 

lie Glu Ser Val Leu Phe 
350 

Thr Ala Leu Leu Ser Arg 

365 

Lys Ser Asp Tyr Val Lys 
380 

lie Phe Glu Arg Met Lys 
395 400 

Pro Tyr Gly Gly Arg Met 
415 

Pro His Arg Ser Gly Asn 
430 

Asp Glu Leu Gly Val Glu 
445 

Val Met Tyr Asp Tyr Met 
460 

Ala Phe Leu Asn Tyr Arg 
475 480 

Lys Asn Ala Tyr Gly Glu 
495 

Glu Thr Asn Tyr Lys Arg 
510 

Ser Asn Phe Phe Arg Asn 
525 

Trp Lys 
540 



(2) INFORMATION FOR SEQ ID NO : 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
( i i i ) HYPOTHET I C AL : NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 9 
GGTAATGATC TCCTTTCTTG TTTGACC 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 
AGAGCGGCCG CTATATTACA ACTTCTCCAC CATCACTCCT 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 61 
GGTGATGTTA ATGATAATCT CCTC 



(2) INFORMATION FOR SEQ ID NO : 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 
AGAGCGGCCG CTACAATTCC TTCAACATGT AAATTTCCTC 



(2) INFORMATION FOR SEQ ID NO : 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 63 
ACTTCCCGTA GAAACTCGGA GACTTTCACA CAATGC 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
TCCATCCAAG ATCAATTCAT AAACTGTGTC 



(2) INFORMATION FOR SEQ ID NO : 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 65: 
AGAGCGGCCG CTTTCATGAA CCTAGCTTCT AGTAGG 



(2) INFORMATION FOR SEQ ID NO : 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 
( i i i ) HYPOTHET I CAL : NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 66: 
AGAGCGGCCG CGAAATGGCC CCCCTTTTAA AACGGGG 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 7^ 
AGAGCGGCCG CAAATGATAT CTTCAGGTAA CTTTGTTCAC 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
AGAGCGGCCG CATAATCAAA TAAATACACT TATGGTAACA CAG 



(2) INFORMATION FOR SEQ ID NO : 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
AGAGCGGCCG CTGGTTTTGT ATTGAGGACT CAAAACAG 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(1..570, 801.. 1754) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GOT ATC TTC TTC TCC GGA AAT 96 
Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 
Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 - 45 

AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 
Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 



GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 24 0 
Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 



CAG ATG AAA ATC AGA AGT GGA GGC CAC GAG TAG GAT GGC TTG TCA TAG 288 
Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

GTT ACA TAG TOT GGC AAA CCG TTC TTC GTC CTC GAG ATG TTT AAC CTC 336 
Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

CGT TCG GTG GAT GTC GAT GTG GGA AGT AAG ACC GCG TGG GTC CAA ACC 384 
Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

GGT GGC ATA CTC GGA GAA GTT TAT TAG TAT ATA TGG GAG AAG AGC AAA 432 
Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 480 
Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

CAT ATC AGT GGT GGA GGT TAG GGT AAC ATG ATG AGA AAA TAG GGT CTC 52 8 
His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 



ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAG GTT AAT 57 0 

Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn 
180 185 190 

GGTATAATTG ATATCTCTAT TTTATATACT AATTAAATTT TATAGTGTGG ATCGGATAGT 63 0 



GATTTTGGTC CATCAATTAA AAACTTGGTG AACATAAAAT TAACCAAGCA ATCAATTTAG 690 



ACAAGCAACA TAATCATATA TATTTTTCTT ACATTTGTAT GTACCTGAAT ATTTATATTT 750 



ATGTTTATAT GTTCTCACTA TATTTTCACT TTTGTATTTG AAAATTTTTA GGA AAA 806 

Gly Lys 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAG TGG GCA ATA 7UVC 854 
lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAG GGC GTC GTA TTG GCC TAG AAA ATA T^C 902 
Gly Gly Gly . Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 



CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 950 
Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 
225 230 235 240 



TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 998 
Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 
245 250 255 



CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAG GTA GTA 
Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 
260. 265 270 



1046 



AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 10 94 
Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 
275 280 285 

ATG TTT CTA GGA GAG ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 1142 
Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 1190 
Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 123 8 
lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 

ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 12 86 
Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

TCA GAT TAC GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 1334 
Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 

TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 13 82 
Trp Lys Lys Met He Glu Leu Glu He Pro Thr Met Ala Phe Asn Pro 
370 375 380 

TAC GGT GGT GAG ATG GGG AGG ATA TCA TTA CGG GTG ACT CCG TTC CCA 143 0 
Tyr Gly Gly Glu Met Gly Arg He Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 

TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 14 78 
Tyr Arg Ala Gly Asn Leu Trp Lys He Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 1526 
Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 15 74 
Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT AAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 1622 
Asn Asn Arg Asp Val Asp Leu Gly He Asn Ser His Asn Gly Lys He 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 167 0 
Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 1718 
Asn Phe Glu Arg Leu Val Lys He Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 



TTC TTT AGG AAC GAA CAC AGT ATT CCT GTG TTA CCA TAA 
Phe Phe Arg Asn Glu His Ser lie Pro Val Leu Pro 
500 505 



1757 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 

Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
130 135 140 

Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 



Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 
225 230 235 240 



Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 
245 250 255 

Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 
260 265 270 

Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 
275 280 285 

Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 
290 295 300 

Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 

325 330 335 

Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 

Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 

Tyr Gly Gly Glu Met Gly Arg lie Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 

Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

Asn Asn Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

Phe Phe Arg Asn Glu His Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 72: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



(iii) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1524 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 



TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 
Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 
Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 
Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 



GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 24 0 
Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 2 88 
Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 



GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 3 36 
Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 



CGT TCG GTG GAT GTC GAC GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 384 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 

115 120 125 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 432 

Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 

130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 480 



Thr Leu Ala Tyr Pro Ala Gly He Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 



CAT ATC AGT GGT GGA GGT TAG GGT AAC ATG ATG AGA AAA TAG GGT CTC 52 8 
His He Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTA AAT GGA AAA 5 76 
Thr Val Asp Asn Thr He Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAG TGG GCA ATA AAC 624 
He Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala He Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAG AAA ATA AAC 6 72 
Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys He Asn 
210 215 220 

CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 72 0 
Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg He Ser Arg Thr 
225 230 235 240 

TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 768 
Leu Glu Gin Asn Ala Thr Asp He He His Arg Trp Gin Gin Val Ala 
245 250 255 

CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 816 
Pro Lys Leu Pro Asp Glu Leu Phe He Arg Thr Val He Asp Val Val 
260 265 270 

AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 864 
Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe He Ala 
275 280 285 

ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 912 
Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser He Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 960 
Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 10 08 
He Gin Ser Val Leu Phe Trp Thr Asn He Gin Val Gly Ser Ser, Glu 
325 330 335 

ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 1056 
Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

TCA GAT TAC GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 1104 
Ser Asp Tyr Val Arg Glu Pro He Ser Arg Thr Gly Leu Glu Ser He 
355 360 365 



TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 1152 
Trp Lys Lys Met He Glu Leu Glu He Pro Thr Met Ala Phe Asn Pro 
370 375 380 



TAC GGT GGT GAG ATG GGG AGG ATA TCA TCT ACG GTG ACT CCG TTC CCA 12 00 
Tyr Gly Gly Glu Met Gly Arg lie Ser Ser Thr Val Thr Pro Phe Pro 
385 390 395 400 

TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 1248 
Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 12 96 
Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 1344 
Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT TAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 13 92 
Asn Tyr Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 144 0 
Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 148 8 
Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

TTC TTT AGG AAC GAA CAG AGT ATT CCT GTG TTA CCA TAA 152 7 

Phe Phe Arg Asn Glu Gin Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 

Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 



Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 



Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
130 135 140 

Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 

Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 
225 230 235 240 

Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 
245 250 255 

Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 
260 265 270 

Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 
275 280 285 

Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser IJ.e Leu Asn Arg Arg 
290 295 300 

Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 

Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 



Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 



Tyr Gly Gly Glu 
385 

Tyr Arg Ala Gly 



Asp Glu Thr Leu 

420 

Gin Phe Met Thr 
435 

Asn Tyr Arg Asp 
450 

Ser Ser Tyr Val 
465 

Asn Phe Glu Arg 



Phe Phe Arg Asn 

500 



Met Gly Arg lie 
390 

Asn Leu Trp Lys 

405 

Thr Asp Arg Tyr 



Pro Phe Val Ser 
440 

Val Asp Leu Gly 
455 

Glu Gly Lys Arg 
470 

Leu Val Lys lie 
485 

Glu Gin Ser lie 



Ser Ser Thr Val 
395 

lie Gin Tyr Gly 
410 

Met Glu Leu Thr 
425 

Lys Asn Pro Arg 



lie Asn Ser His 
460 

Tyr Gly Lys Lys 
475 

Lys Thr Arg Val 
490 

Pro Val Leu Pro 
505 



Thr Pro Phe Pro 
400 

Ala Asn Trp Arg 
415 

Arg Lys Leu Tyr 
430 

Gin Ser Phe Phe 
445 

Asn Gly Lys lie 



Tyr Phe Ala Gly 
480 

Asp Ser Gly Asn 
495 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1527 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TCC ATC CAA GAT CAA TTC ATA AAC TGT GTC AAA AGA AAC ACA CAT GTT 48 
Ser lie Gin Asp Gin Phe lie Asn Cys Val Lys Arg Asn Thr His Val 
15 10 15 



TCT TTT CCA CTC GAG AAA ACG TTA TTC ACC CCT GCG AAA AAC GTC TCT 
Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 



96 



TTG TTC AAC CAA GTC CTT GAA TCG ACG GCT CAA AAT CTC GAG TTC TTG 144 
Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 

GCA AAA TCC ATG CCT AAA CCG GGA TTC ATA TTC AGA COG ATT CAC CAG 192 
Ala Lys Ser Met Pro Lys Pro Gly Phe lie Phe Arg Pro lie His Gin 
50 55 60 

TCT CAA GTC CAA GCT TCC ATC ATT TGT TCA AAG AAA CTC GGA ATT CAT 24 0 
Ser Gin Val Gin Ala Ser lie lie Cys Ser Lys Lys Leu Gly lie His 
65 70 75 80 

TTT CGT GTT AGA AGT GGC GGT CAC GAT TTC GAG GCC TTG TCT TAT GTT 2 88 
Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 90 95 

TCA CGG ATT G7VA AAA CCG TTT ATA TTA CTC GAC CTG TCA AAA TTG AAA 3 36 
Ser Arg lie Glu Lys Pro Phe lie Leu Leu Asp Leu Ser Lys Leu Lys 
ICQ 105 110 

CAA ATC AAT GTT GAT ATT GAA TCC AAT AGT GCT TGG GTT CAA CCT GGT 3 84 
Gin lie Asn Val Asp lie Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

GCT ACG CTT GGT GAG CTT TAG TAG AGA ATT GCA GAG AAG AGC AAG ATC 43 2 
Ala Thr Leu Gly Glu Leu Tyr Tyr Arg lie Ala Glu Lys Ser Lys lie 
130 135 140 

CAT GGA TTT CCC GCG GGT TTG TGC ACA AGT GTA GGC ATA GGT GGG TAT 48 0 
His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly lie Gly Gly Tyr 
145 150 155 160 

ATG ACA GGC GGT GGA TAG GGT ACC TTG ATG AGG AAG TAT GGT CTT GCG 52 8 
Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 170 175 

GGA GAT AAT GTT CTA GAC GTA AAG ATG GTT GAT GCA AAT GGT AAA TTA 5 76 
Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 

CTC GAC AGA GCC GCG ATG GGT GAG GAC CTA TTT TGG GCG ATT AGA GGA 624 
Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala lie Arg Gly 
195 200 205 

GGC GGT GGA GCG AGT TTC GGG ATA GTT CTA GCA TGG AAG ATC AAG CTT ' 6 72 
Gly Gly Gly Ala Ser Phe Gly lie Val Leu Ala Trp Lys lie Lys Leu 
210 215 220 

GTT CCT GTT CCT AAG ACT GTT ACC GTC TTC ACT GTC ACC AAA ACG TTA 72 0 
Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 

GAA CAA GAC GCA AGA TTG AAG ACT ATT TCT AAG TGG CAA CAA ATT TCA 76 8 
Glu Gin Asp Ala Arg Leu Lys Thr lie Ser Lys Trp Gin Gin lie Ser 
245 250 255 



TCC AAG ATT ATT GAA GAG ATA CAC ATC CGA GTG GTA CTC AGA GCA GCT 816 
Ser Lys lie lie Glu Glu lie His lie Arg Val Val Leu Arg Ala Ala 
260 265 270 

GGA AAT GAT GGA AAC AAG ACT GTG ACA ATG ACC TAG CTA GGT CAG TTT 864 
Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

CTT GGC GAG AAA GGC ACC TTG CTG AAG GTT ATG GAG AAG GCT TTT CCA 912 
Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

GAA CTA GGG TTA ACT CAA AAG GAT TGT ACT GAA ATG AGC TGG ATT GAA 96 0 
Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp lie Glu 
305 310 315 320 

GCC GCC CTT TTC CAT GGT GGA TTT CCA ACA GGT TCT CCT ATT GAA ATT 1008 
Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro lie Glu lie 

325 330 335 

TTG CTT CAG CTC AAG TCG CCT CTA GGA AAA GAT TAG TTC AAA GCA ACG 1056 
Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

TCG GAT TTC GTT AAA GAA CCT ATT CCT GTG ATA GGC TTC AAA GGA ATA 1104 
Ser Asp Phe Val Lys Glu Pro lie Pro Val lie Gly Phe Lys Gly lie 
355 360 365 

TTC AAA AGA TTG ATT GAA GGA AAC ACA ACA TTT CTG AAC TGG ACT CCT 1152 
Phe Lys Arg Leu lie Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 

TAG GGT GGT ATG ATG TCG AAA ATC CCT GAA TCT GCG ATC CCA TTT CCG 12 00 
Tyr Gly Gly Met Met Ser Lys lie Pro Glu Ser Ala lie Pro Phe Pro 
385 390 395 400 

CAT AGA AAC GGA ACC CTC TTC AAG ATT CTC TAT TAC GCG AAC TGG CTA 124 8 
His Arg Asn Gly Thr Leu Phe Lys lie Leu Tyr Tyr Ala Asn Trp Leu 
405 410 415 

GAG AAT GAC AAG ACA TCG AGT AGA AAA ATC AAC TGG ATC AAA GAG ATA 12 96 
Glu Asn Asp Lys Thr Ser Ser Arg Lys lie Asn Trp lie Lys Glu lie 
420 425 430 

TAC AAT TAC ATG GCG CCT TAT GTC TCA AGC AAT CCA AGA CAA GCA TAT 1344 
Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 

GTG AAC TAC AGA GAT CTA GAC TTC GGA CAG AAC AAG AAC AAC GCA AAG 1392 

Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 

450 455 460 

GTT AAC TTC ATT GAA GCT AAA ATC TGG GGA CCT AAG TAC TTC AAA GGC 144 0 

Val Asn Phe He Glu Ala Lys He Trp Gly Pro Lys Tyr Phe Lys Gly 
465 470 475 480 

AAT TTT GAC AGA TTG GTG AAG ATT AAA ACC AAG GTT GAT CCA GAG AAC 1488 
Asn Phe Asp Arg Leu Val Lys He Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 



TTC TTC AGG CAC GAG CAG AGT ATC CCA CCT ATG CCC TAG TAG 
Phe Phe Arg His Glu Gin Ser lie Pro Pro Met Pro Tyr 

500 505 



1530 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ser lie Gin Asp Gin Phe He Asn Cys Val Lys Arg Asn Thr His Val 
15 10 15 

Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 

Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 

Ala Lys Ser Met Pro Lys Pro Gly Phe He Phe Arg Pro He His Gin 
50 55 60 

Ser Gin Val Gin Ala Ser He He Cys Ser Lys Lys Leu Gly He His 
65 70 75 80 

Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala. Leu Ser Tyr Val 
85 90 9S 

Ser Arg He Glu Lys Pro Phe He Leu Leu Asp Leu Ser Lys Leu Lys 
100 105 110 

Gin He Asn Val Asp He Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

Ala Thr Leu Gly Glu Leu Tyr Tyr Arg He Ala Glu Lys Ser Lys He 
130 135 140 

His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly He Gly Gly Tyr 
145 150 155 160 

Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 170 175 

Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 

Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala He Arg Gly 
195 200 205 

Gly Gly Gly Ala Ser Phe Gly He Val Leu Ala Trp Lys He Lys Leu 
210 215 220 



Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 



Glu Gin Asp Ala Arg Leu Lys Thr lie Ser Lys Trp Gin Gin lie Ser 
245 250 255 

Ser Lys lie lie Glu Glu lie His lie Arg Val Val Leu Arg Ala Ala 

260 265 270 

Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp lie Glu 
305 310 315 320 

Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro lie Glu lie 
325 330 335 

Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

Ser Asp Phe Val Lys Glu Pro He Pro Val He Gly Phe Lys Gly He 
355 360 365 

Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 

Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He Pro Phe Pro 
385 390 395 400 

His Arg Asn Gly Thr Leu Phe Lys He Leu Tyr Tyr Ala Asn Trp Leu 
405 410 415 

Glu Asn Asp Lys Thr Ser Ser Arg Lys He Asn Trp He Lys Glu He 
420 425 430 

Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 

Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 
450 455 460 

Val Asn Phe He Glu Ala Lys He Trp Gly Pro Lys Tyr Phe Lys Gly 
465 470 475 480 

Asn Phe Asp Arg Leu Val Lys He Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 



Phe Phe Arg His Glu Gin Ser He Pro Pro Met Pro Tyr 

500 505 



